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Quantum dynamics is linear. How do we know? From theory or experiment? The 
history of this question is reviewed. Nonlinear generalizations of quantum mechanics 
have been proposed. They predict small but clear nonlinear effects, which very 
accurate experiments have not seen. Is there a reason in principle why nonlinearity is 
not found? Is it impossible? Does quantum dynamics have to be linear? Attempts to 
prove this have not been decisive, because either their assumptions are not compelling 
or their arguments are not conclusive. The question has been left unsettled. There is 
a simple answer, based on a simple assumption. It was found in two steps separated 
by 44 years. They are steps back to simpler and more compelling assumptions. A 
proof of the assumptions of the Wigner-Bargmann proof has been known since 1962. 
It assumes that the maps of density matrices in time are linear. For this step, it 
is also assumed that density matrices are mapped one-to-one onto density matrices. 
An alternative is to assume that pure states are mapped one-to-one onto pure states 
and that entropy does not decrease. In a step taken in 2006, it is proved that the 
maps of density matrices in time are linear. It is assumed, as in the earlier step, 
that at each time the physical quantities and states are described by the usual linear 
structures of quantum mechanics, so the question is only about how things change 
in time. Beyond that, the proof assumes only that the dynamics does not depend 
on anything outside the system, but must allow the system to be described as part 
of a larger system. 
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I. INTRODUCTION 



Can we prove that quantum dynamics has to be linear? That the Schrodinger equation 
has to be linear? What assumptions are needed? Is there a reason in principle that quantum 
dynamics is linear? If not, quantum dynamics could be a linear approximation of a nonlinear 
theory; experiments might reveal small nonlinear effects. This possibility is too interesting 
to give up easily. We want to see if it can work. We will accept that it is impossible only if 
we see an undeniably convincing reason. 

Wigner and Bargmann proved {2], 3] that quantum dynamics must be linear if it does not 
change absolute values of inner products of state vectors. Bialynicki-Birula and MycielskiJ 
proposed nonlinear Schrodinger equation that inspired precise experimental testsjsl, Q, 



Weinberg 



9] proposed a general nonlinear form of quantum mechanics, which led to more 



experimental tests [10 



11 



12 



131 ]. The seriousness of these proposals and their experimental 



tests shows that the Wigner-Bargmann proof of linearity was not decisive; its assumption 
was not accepted. A more recent idea that relativity requires quantum dynamics to be linear 



HQ 



161 ] has not been conclusive. The question has been left unsettled. This history is 



described in Sections II-IV. 

There is a simple answer, based on a simple assumption. It was found in two steps 
separated by 44 years. They are steps back to simpler and more compelling assumptions. 
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Q,Q,Q 



A proof of the assumptions of the Wigner-Bargmann proof has been known 
since 1962. This earlier step is reviewed in Section V. It assumes that the maps of density 
matrices in time are linear. For this step, it is also assumed that density matrices are mapped 
one-to-one onto density matrices. An alternative is to assume that pure states are mapped 
one-to-one onto pure states and that entropy does not decrease. 

In a step taken in 2006, it is proved that the maps of density matrices in time are linear 
It is assumed, as in the earlier step, that at each time the physical quantities and states 
are described by the usual linear structures of quantum mechanics, so the question is only 
about how things change in time. Beyond that, the proof assumes only that the dynamics 
does not depend on anything outside the system, but must allow the system to be described 
as part of a larger system. The proof is described in Section VII. The assumption is discussed 
in Section VIII. 
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II. PROPOSALS AND TESTS FOR NONLINEAR QUANTUM DYNAMICS 

Is quantum mechanics a linear approximation 
of a more fundamental nonlinear theory? 
With small nonlinear effects not yet seen? 
How might they be seen? 

n 

One possibility, proposed by Bialynicki-Birula and Mycielski[4j], is a nonlinear Schrodinger 



equation 



% d ± = E^= [^V 2 + V — bln{\^)]ip. (2.1) 



The size of the nonlinearity is set by a small positive number b. The In function makes it 
simple to normalize wave functions and to describe a system composed of noninteracting sub- 
systems. It gives the familiar relation between energy and frequency. And there are soliton 
solutions, wave packets that do not spread. Altogether, this is a very attractive candidate 
for a nonlinear quantum mechanics. A proposal [5] to test it with neutron interferometry 



was realized first with a two-slit experiment 



6| that gave 



b < 3.4 x VQT xz eV (2.2) 

and then with Fresnel diffraction that lowered the limit to 

b < 3.3 x l(T 15 eV. (2.3) 

This is wave mechanics pure and simple. It is about how a wave function behaves if it 
satisfies this nonlinear wave equation. 

What makes the dynamics nonlinear is that the Hamiltonian depends on the wave func- 
tion; the Hamiltonian depends on the state. For a wave function of position, this gives a 
nonlinear Schrodinger equation like (12. ip . For a spin described by Pauli matrices S 1; S 2 , S3, 
it means the Hamiltonian can be a matrix like 

# = e(£ 3 >£ 3 . ( 2 - 4 ) 



The Hamiltonian is a matrix that depends on the state, because S3 is a matrix and the 
mean value (E3) depends on the state. The equations of motion for the mean values (Si), 
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(S 2 ), (£3) for this Hamiltonian are the same as for any Hamiltonian that is a number times 
a spin matrix, 

i(E0 = -2e(S 3 )(S 2 ) 
|(S 2 ) = 26<£ 3 ><£i> 

|<E.) - (2.5) 

These equations of motion are nonlinear because the Hamiltonian contains a mean value. 
They describe precession with a frequency that depends on the state. 

This is a representative case of Weinberg's nonlinear quantum mechanics , which 
brings in nonlinear dynamics for spin. Weinberg's framework is very broad. It lets ordinary 
linear quantum mechanics be changed in various ways. For example, the representation of 
physical quantities by linear operators can be changed. I looked at that possibility and 
decided it is not a fruitful way to go; for a spin it gives nothing new [2l|]. I concluded, 
as I think everyone has, that it is more reasonable to assume that at each time everything 
is the same as in ordinary linear quantum mechanics, so the question is just how things 



change in time. Making that assumption, and following a suggestion of Polchinski 22| to 
make Weinberg's theory applicable to a system composed of noninteracting subsystems, I 
showed that Weinberg's theory is just the statement that the Hamiltonian depends on the 
state^that the Hamiltonian, like our example ( 12. 4ft . involves mean values as well as operators 



23l . |24j. Actually, this gives a bit more than Weinberg's theory; it includes some simple 
examples that Weinberg's theory can not describe 24]. 



Weinberg's proposal inspired four precise spectroscopic tests 

HQ HQ- Two looked 
for the effect of a term like (12.4]) in the Hamiltonian. The other two looked for the effect of 
a similar term for spin 3/2. None found a nonlinear effect. They put limits down to 

|e| < 1.6 x l(T 20 eV 

< 2 x 1(T 27 of binding energy p er nucleon (2.6) 

on the size of the nonlinear interaction. 

Are these searches for something that doesn't exist? Is nonlinear quantum dynamics 
impossible? Does quantum dynamics have to be linear? 
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III. THE WIGNER-BARGMANN PROOF OF LINEAR DYNAMICS 

Can we prove that quantum dynamics has to be linear? 
Yes, if we assume that 

pure states <-» pure states 

ip <-> ip' is one to one 
and \(m\ 2 = \WW)?- 

That is what Wiener and Bargmann showed. They proved that the change from 
ip to ip' can be made with an operator that is either linear or antilinear. The product of 
two antilinear operators is linear, so if the change can be made in two similar steps, like the 
change over a time interval that can be split into halves, the operator must be linear. If we 
assume a group property, that the change over a time interval is the change over part of 
the interval followed by the change over the rest of the interval done the same way, and we 
assume that changes of probabilities in time are continuous, we get the linear Schrodinger 



equation 



21 



These proofs were made long before the proposals and tests for nonlinear quantum me- 
chanics. Why were theories developed and experiments done to look for something that had 
been proved impossible? I think the reason is that the assumption of 

KV#>l 2 = KW>l a (3-i) 

is not compelling. What does it say? States represented by ip and (ft at one time go to 
states represented by ip' and <$' at a later time. Given the state represented by ip at the 
earlier time, the probability that if you make a measurement at the earlier time you will 
find the state represented by cf) is the same as the probability that if you wait and make a 
measurement at the later time you will find the state represented by . That's reasonable. 
But does it have to be true? Can you imagine how it might not be true? Bialynicki-Birula 
and Mycielski describe one way. Weinberg describes another. The Hamiltonian depends on 
the state is how I would say it. Why not? 



6 



IV. RELATIVITY? 

In the discussion following Weinberg's proposal and its experimental tests there was a 
new idea. 

Can we prove that quantum dynamics has to be linear? 
Does relativity require it? 

Does nonlinearity imply signals faster than light? 

I have seen three papers that argue it does. NotingQ that two of them Q, 13 do not apply 
to Weinberg's theory, I am interested in only one[l5|. 

It works with the specific example of Weinberg's theory described by the Hamiltonian 
( 12. 4p and equations of motion (12. 5p that we have already seen. The solutions of these 
equations of motion are that (S3) is a constant and 

(E 1 )(t) = (E 1 )(0)cos(2e(E 3 )t)-(E 2 )(0)sin(2e(E 3 )t) 

(E 2 )(i) = (S 2 )(0) COS (2e(E3)t) + (E 1 )(0) S m(2e(E 3 )t). (4.1) 

Consider two different sets of initial conditions: In set (a) 

(E 3 )(0) = ±1. (4.2) 

Then 

(E 1 )(0) = 0=(E 2 )(0), (4.3) 
because the sum of the squares can't be larger than 1, so 

(E 1 )(t) = 0=(E 2 )(t). (4.4) 

The initial conditions (b) are that 



(E 3 )(0) = ±-^ = (E 1 )(0). (4.5) 



Then (E 2 }(0) is and 



I 1 

(E 2 )(t) = ±- r sin(±V2et) = —=sin(V2et). 
a/2 a/2 



(4.6) 



Even for 50 — 50 mixtures of the + and — cases, the result is different for the initial conditions 
(a) and (b). That, it is argued, allows signals faster than light. 
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The argument [ 15| uses two particles with spin 1/2 in the state with total spin 0. The 
particles move away from their common source in different directions to separated locations 
where their spins are measured, one by Alice and one by Bob. This is the much-used set-up 
;hat comes from Bohm's 1951 version of the 1935 argument of Einstein, Podolsky and Rosen 



mm 



< > 



If Alice and Bob measure the components of their spins in the same direction, then when 
Alice gets +, Bob gets — , and when Alice gets — , Bob gets +. This is experimentally 
verifiable; you can watch it happen over and over. 

Suppose now that when a spin comes to Bob, he runs it through the nonlinear dynamics 
described by the Hamiltonian ( 12. 4p for a time t and then measures the component of the 
spin in the y direction. Suppose he does this for each of a long string of spins while Alice 
measures the z component of each spin that comes to her. Alice gets + half the time and — 

n 

half the time, on the average. The argument [15j is that then (£ 3 ) is 1 for half the spins that 
Bob receives and (£3) is —1 for the other half. The spins that come to Bob are described 
by the initial conditions (a). After he runs a spin through the nonlinear dynamics and is 
ready to measure the y component, Bob has a spin in a state where (£2) is 0. He gets + 
half the time and — half the time for the y component. 

Suppose that instead Alice measures the component of each spin in the direction 45° 



between z and x. She gets + half the time and — half the time. The argument [151] is that 
then the spins that come to Bob are + in the 45° direction half the time and — in the 45° 
direction half the time. They are described by the initial conditions (b). After Bob runs a 
spin through the nonlinear dynamics and is ready to measure its y component, he has a spin 
in a state where (£2) is (l/\^2)sin(y2et). He can get + more than half the time or — more 
than half the time, depending on his choice of t. The argument [jjj concludes that Bob can 
tell whether Alice is measuring spin components in the z direction or in the 45° direction. 
Alice can send a signal to Bob. Since there is no restriction on the distances and times, the 
signal can be faster than light. 

It's not necessarily so. If there is not time for a signal at the speed of light between the 
measurements made by Alice and Bob, then moving observers will disagree about which 
measurement happens first. The 50 — 50 probabilities for Alice's results are calculated 
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from the state of total spin for the two spins, without considering what Bob does. The 
probabilities for Bob's results can be calculated the same way, without considering what 
Alice does. Then there is no signal (23 . 



28 



29|]. 



If there is time for a signal at the speed of light between the measurements made by 
Alice and Bob, so Alice can call and tell Bob what is coming, then the states of Bob's spins 
are prepared as the argument describes, and the probabilities for Bob's results are what the 
argument says. Then there is a signal, but it is not faster than light. 
What can be said 



28 



291 ]. and this is where the argument gets some traction, is that in 



the presence of the nonlinear dynamics and the absence of signals faster that light, there 
must be an abrupt change in Bob's results as gradual changes of distances or times change 
the separation between the measurements made by Alice and Bob from timelike to spacelike. 
Is this weird? Yes. Does it prove that this nonlinear dynamics is impossible? No. That 
it does not is shown particularly clearly in a delightful paper written as a fantasy about a 
fictional character who finds a way to detect the changes in Bob's results 30]. 



V. PROOF OF THE WIGNER-BARGMANN ASSUMPTIONS 

There is a simple solution. It was found in two steps separated by 44 years. They are 
steps back to simpler assumptions. A proof of the assumptions of the Wigner-Bargmann 
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201 ] since 1962. A proof of the main assumption used in it 



proof has been known 

HQ 

was found recently We review the earlier step in this section. The recent step is described 
in Section VII and VIII. 



Can we prove that quantum dynamics has to be linear? 

Yes, if we assume, or can prove, 

that the change of density matrices is linear. 

If p — > p' is linear and one to one, 
then pure states pure states 

and|(^)| 2 H(W>| 2 - 



The proof is simple. We assume the dynamics applies to all states. In an interval of 
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time, it maps every density matrix p to a density matrix p' . The assumption that the map 
is linear means that if p\ and p 2 are mapped to p[ and p' 2 , then 

P = PPi + 0--p)P2, (5-1) 
for a number p between and 1, is mapped to 

p' = pp[ + (l-p)p' 2 . (5.2) 

We assume the map is one to one, so it has an inverse. The inverse map is linear; the 



proof that the inverse of a linear operator is linear 25, Theorem 7.1] applies with attention 
restricted to density matrices. 

Pure states are mapped to pure states. A pure state can't go to a mixed state: if p' x and 
p' 2 in Eq. (15.21) are distinct, so are p\ and p 2 in Eq. fl5.ll) ; thus if p' is for a mixed state, so 
is p. The inverse map also takes pure states to pure states, so the set of all pure states is 
mapped one-to-one onto itself. 

For each vector of length 1, let be a vector of length 1 such that is 
l^')^'!- For each density matrix p there are orthonormal vectors and positive numbers 
Pj whose sum Y^jPj is 1 such that 



The linearity implies that 



p = Y,Pj\^j)^j\- ( 5 - 3 ) 



p' = Y,pMM\- ( 5 - 4 ) 



Since {ipj\ip'j} is 1, 



Tr[(p') 2 ] = EPiPM'M^ > Efe) 2 = Tr[p 2 ]. (5.5) 

jk j 



The same result for the inverse implies that 

Tr[(p') 2 ] = Tr[p 2 ]. (5.6) 

Let and \<p) be vectors of length 1 and let 

P=^)<V>l + ~i0>(0l- (5-7) 



Then 



Tr[p 2 }= 1 - + l\(m\ 2 (5-8) 
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and 

p' = \WW\ + \\<t>'){<t>X (5-9) 

so Eq. (15.61 ) implies that 

i<v>w=Kv#>r. (s.io) 

Wigner and Bargmann will take it from there. 

Linear maps of density matrices can also be used to describe processes where pure states 
are mapped to mixed states, different states are mapped to the same state, the map is not 
onto all states or, generally, the map has no inverse that applies to all states. An assumption 
is needed to separate these processes from dynamics described by the Schrodinger equation, 
which has an inverse for all states. For this proof we assumed that the map of density 
matrices is one to one. 

An alternative is to assume that it is the set of all pure states that is mapped one-to-one 
onto itself, instead of the set of all density matrices. Then the inequality (I5.5P is obtained 
as before. If it is assumed that the entropy does not decrease [31|, then 

Tr[(p') 2 ] < Tr[p 2 ]. (5.11) 

This implies Eqs. (15.61) and (15. 10H . 



VI. WHAT IS LINEAR 



The result of quantum dynamics is the time dependence of mean values for Hermitian 
operators representing physical quantities. This includes the time dependence of probabil- 
ities, which are mean values for projection operators. The result is the same whether it is 
obtained from the Schrodinger picture or the Heisenberg picture. The mean value (P) for a 
Hermitian operator P is Tr [Pp] where p is the density matrix that represents the state. In 
the Schrodinger picture, the time derivative of (P) is 

The linearity of the map of density matrices means that for a density matrix p that is a 
mixture described by Eq. (15.11) . the time derivative is 
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so for a mean value 

d(P) = dTrjPp] = dp 
dt dt [ dt 1 

= pTr[P^] + (1 - P)Tr[P^]. (6.3) 

This means that the time derivative of a mean value depends on the state in a linear way. It 
can not be a product of mean values as it is in the equations of motion (12. 5p for our example 
of Weinberg's theory. The time derivative of an operator can be a product of operators; the 
equations of motion in the Heisenberg picture can be nonlinear. But the time derivative of 
a mean value has to be a linear function of mean values jl| ■ 

VII. PROOF OF LINEAR MAPS FOR DENSITY MATRICES 

Can we prove that the change of density matrices is linear? 
Yes, if we assume that the system can coexist with another 
without interaction. 



The proposition that quantum 



32 



dynamics is described most generally by linear maps of 



33j in 1961. Now we have a proof of it. This is the recent 



density matrices was set out 
step [1] . It is the conclusion of the chain of steps, the proof of the assumption of the proof 
of the assumption of the Wigner-Bargmann proof. The assumption we use to prove it is 
discussed in the next section. Here is the proof. 

Suppose the system S that we are considering is one of two separate systems 5* and R. 
We consider the larger system composed of the two subsystems 5* and R. We assume there 
can be something else, the other system R, and we assume that it makes no difference. We 
assume the dynamics of S does not depend on anything outside S. Not on what happens 
in R, or on the state of R, or on any correlations of the state of 5* with the state of R. We 
assume that S can be described as part of the larger system of S and R combined, but we 
assume that the dynamics of 5* does not depend on the situation of 5* in the larger system. 

Suppose the state of the larger system of S and R combined is represented by the density 
matrix 

n = p Pl \a)(a\ + (l-p)p2\P)(P\ (7.1) 
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where |a) and \(3) are orthonormal vectors for R that do not depend on the time and, as 
before, pi and pi are density matrices for S and p is a number between and 1. The reduced 
density matrix Tr^IT, which is the density matrix p for S, is described by Eg. (15.11 ). The 
probability (P) for a proposition represented by a projection operator P for S is the sum of 
joint probabilities 

(P) = Tr SR [PU\ =Tr SR [P\a)(a\Tl\ + TT SR [P\(3)((3\Tl\ 

= (P\a)(a\) + (P\P)(P\) = pTr s [Ppi] + (1 -p)Tr s [Pp 2 \. (7.2) 

Suppose a measurement is made on R that distinguishes the states represented by |a) and 
\/3). The probability is p that the result is |a) and 1 — p that the result is \(3). If the result 
is | a), the probability for the proposition represented by P is Tis[Ppi], and if the result is 
the probability for P is Tis[Pp2\- This can be verified experimentally by repeating the 
process of preparing the state represented by n, measuring to distinguish the states of R, 
and testing various propositions for S. The density matrices p\ and p2 describe physically 
distinct possibilities. The times of the events can be changed as long as the measurement on 
R is early enough. The time dependence of pi and p 2 will account for changes in the results. 
The time derivative of the probability for the proposition represented by P is Tis[Pdpi/dt] 
or Tr s[Pdp2/dt] depending on the result of the measurement on R. Altogether, with the 
probabilities for both results being considered, the time derivative of the probability for P 
is 

pTrs[p ir ] + (1 - p)Trs[p ^r ] = Trs[p{p ir + {l - p)d -M )] - (7 - 3) 

The probability (P) for the proposition represented by P is also 

(P) = Tr SR [PU\ = Tr s [PTr R U\ = Tr s [Pp}. (7.4) 

Its time derivative is Tr $ [Pdp/dt]. This is always correct. It may be the only possibility at 
hand. For example, suppose the state of 5* and R combined is represented by the density 
matrix 

IT = pp\a)(a\ + (1 - p)p\(3){(3\ = p[p\a)(a\ + (1 - p)\(3){(3\}. (7.5) 

The dynamics for S must be the same for II as for II. 

The time derivative of the probability for the proposition represented by P is always 
Tr 5 [Pdp/dt]. There are situations where it must also be described by Eq. (l7.3p to fit obser- 
vations of events in a larger system. The dynamics in S can not depend on the situation of 
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S in a larger system. Therefore 

Tr y± ]=TTs[P( /-£ + (!-,,£)]. (7.6) 
From this equality for various projection operators P, we conclude that 

^ =p W + ^- p) W (7 ' 7) 



VIII. THE ASSUMPTION AT THE BOTTOM 

What we need to assume is just that the system we are considering can coexist with an- 
other system without interaction, that the dynamics we are considering can be independent 
of something else in the universe, that the system we are considering can be described as 
part of a larger system without interaction with the rest of the larger system. 

This is like the implicit assumption we make every day when we consider a part of the 
universe and assume the rest has no effect. We take that assumption for granted. Without 
it there would be no physics at all. 

Actually that implicit assumption is stronger than the assumption we make here. We 
implicitly assume that everything outside the considered system has no effect. Here we 
assume only that there is something out there that has no effect. The limit to the validity of 
the implicit assumption comes from the part of the universe outside the considered system 
that has the largest effect. The limit on the validity of the assumption we make here comes 
from the part that has the smallest effect. 



341 ] , but the proof 



The assumption we make here is close in spirit to the idea of envariance 
of Section VII does not use quantum entanglement. The proof does not even require quantum 
mechanics; it could be done in classical mechanics as well. The correlations involved are 
classical. The substance would be the same in classical mechanics; only the language would 
be different. 



Can we prove that quantum dynamics has to be linear? 
Yes, if we assume that the system can coexist with another 
without interaction. 
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